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environments where agents learn over time. The test is used to study the 
working hypothesis that, because firms have only limited information about 
the quality of workers in the early stages of their careers, they distinguish 
among workers on the basis of easily observable variables that are correlated 
with productivity, such as years of education or degree, the quality of the 
school the person attended, race, and gender. The hypothesis was investigated 
by using panel data on education, the Armed Forces Qualification Test (AFQT) , 
father's education, and wages for young men and their siblings from the 
National Longitudinal Survey of Youth. The empirical implications of 
statistical discrimination on the basis of race were also examined. The 
analysis results supported the hypothesis of statistical discrimination; 
however, they were inconsistent with the hypothesis that firms fully utilize 
the information on race. (Thirty-eight references and 10 tables are included. 
Appended are the standard deviations, minimum and maximums of the variables 
used in the analysis and data on the relationship among wages, schooling, 
AFQT, and parental education.) (MN) 
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Employer Learning and Statistical Discrimination 

Abstract 

We provide a test for statistical discrimination or “rational” stereotyping in environments in which agents 
learn over time. Our application is to the labor market. If profit maximizing firms have limited 
information about the general productivity of new workers, they may choose to use easily observable 
characteristics such as years of education to "statistically discriminate" among workers. As firms acquire 
more information about a worker, pay will become more dependent on actual productivity and less 
dependent on easily observable characteristics or credentials that predict productivity. Consider a wage 
equation that contains both the interaction between experience and a hard to observe variable that is 
positively related to productivity and the interaction between experience and a variable that firms can 
easily observe, such as years of education. We show that the wage coefficient on the unobservable 
productivity variable should rise with time in the labor market and the wage coefficient on education 
should fall. We investigate this proposition using panel data on education, the AFQT test, father's 
education, and wages for young men and their siblings from NLSY. We also examine the empirical 
implications of statistical discrimination on the basis of race. Our results support the hypothesis of 
statistical discrimination, although they are inconsistent with the hypothesis that firms fully utilize the 
information in race. Our analysis has wide implications for the analysis of the determinants of wage 
growth and productivity and the analysis of statistical discrimination in the labor market and elsewhere. 
JEL Classification: D83, J31 
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1. Introduction 



People go through life making an endless stream of judgments on the basis of limited information 
about matters as diverse as the safety of a street, the quality of a car, the suitability of a potential spouse, 
and the skill and integrity of a politician. When hiring, employers must assess the value of potential 
workers with only the information contained in resumes, recommendations, and personal interviews. What 
do employers know about the productivity of young workers, and how quickly do they learn? Given lack 
of information about actual productivity, do employers "statistically discriminate" among young workers 
on the basis of easily observable variables such as education, race, and other clues to a worker's labor force 
preparation. Many issues in labor economics hinge on the answers, including the empirical relevance of the 
signaling model of education (Weiss (1995), statistical theories of discrimination (Aigner and Cain (1977), 
Lundberg and Startz (1983)), and the interpretation of earnings dynamics. The desirability of changes in 
the laws governing hiring procedures, evaluation of employees, layoff and firing costs, and the provision of 
references for former employees also hinge on the answers. Although labor economists typically assume 
wages are strongly influenced by employer beliefs about worker productivity, there is little empirical 
research on how much employers know about their workers, or about how this information changes with 
time in the labor market. 1 

In this paper we explore the implications of a hypothesis that we refer to as Statistical 
Discrimination with Employer Learning, or SD-EL. Our working hypothesis is that firms have only 
limited information about the quality of workers in the early stages of their careers. They distinguish 
among workers on the basis of easily observable variables that are correlated with productivity such as 
years of education or degree, the quality of the school the person attended, race, and gender. (To avoid 
misunderstanding we wish to stress that part of the relationship between wages and race and gender may 
reflect biased inferences on the part of employers or other forms of discrimination that have nothing to do 
with productivity or information.) Firms weigh this information with other information about outside 
activities, work experience to date, references, the job interview, and perhaps formal testing by the firm. 
Each period, the firm observes noisy indicators of the worker's performance. Over time, these make the 
information observed at the start redundant. Wages become more closely tied to actual productivity and 
less strongly dependent upon the information that was readily available at the beginning of a worker's 



'There is a large empirical and theoretical literature on labor market search and on the effects of learning about the quality of 
the job match on wages and mobility. See Devine and Kiefer (1991) for a comprehensive survey. 



career. The main contribution of the paper is to provide a way to test for whether firms statistically 
discriminate on the basis of readily available information such as education and race. We also provide a 
way to estimate the learning profile of firms and address the issue of whether firms have a stable view of 
the productivity of workers with many years of labor market experience. 

Our research builds on some previous work, particularly Farber and Gibbons (1996). 2 Farber and 
Gibbons investigate three implications of employer learning. Imagine a variable s (say schooling) which 
firms can observe directly and a second variable, z (say AFQT test scores or sibling's wage rate) which 
firms cannot observe directly. They show first that employer learning does not imply that the coefficient 
on s in a wage regression will change with experience. This is because future observations, on average, 
simply validate the relationship between expected productivity and s for new entrants. Their empirical 
evidence is generally supportive of this result, although they note that a positive interaction could arise if 
schooling is complementary with training. (Positive interactions are found in a number of data sets, 
including the PSID.) Second, they show that the part of z that is orthogonal to information available to 
employers at the beginning of a worker's careers will have an increasingly large association with wages as 
time passes. Third, they note that wage growth will be a Martingale process, at least in the case in which 
productivity of the worker is constant. 

In this paper we focus on a different but related proposition that allows us to examine the issue of 
statistical discrimination. The proposition concerns how controlling for the experience profile of the 
effect of z on wages alters the interaction between experience and s. We show that not only should the 
coefficient on z rise with time in the labor market, but the coefficient on s should fall. We investigate these 
propositions using data on young men from the NLSY. We also explore the implications of statistical 



2 Other relevant references are Gibbons and Katz (1991) which we discuss below and Parsons (1993). Glaezer (1992) uses 
variances in wage innovations as a measure of learning. His work is somewhat closely related to Farber and Gibbons. 

However, he attempts to distinguish between information that is specific to the job match and information about general 
productivity. Foster and Rosenzweig (1993) use data on piece rate and time-rate workers to investigate several implications of 
imperfect information on the part of employers that are different from the one studied here. Their results imply that the 
incompleteness of employer information is an important issue. Studies following performance evaluations within firms based 
on the EOPP data, or studies using firm personnel files (Medoff and Abraham (1980)) are also relevant, but have a very 
different focus than the present paper. Parsons (1986), Weiss (1995) and Carmichael (1989) provide useful discussions of 
some of the theoretical issues on the link between wages and employer perceptions about productivity. Albrecht (1982) 
conducts a test of screening models of education based on the idea that education will have less impact on the probability a 
worker will be hired if the worker was referred to the firm by another worker because some of the information contained in 
education will be transmitted through the referral. Montgomery (1991) presents a model in which employers obtain valuable 
information on the productivity of new employers through referrals and is part of a large literature on labor market networks. 
For empirical evidence see Holzer (1988). 
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to observe background variables that influence productivity. 3 While our basic theoretical framework and 
most of the empirical analysis assumes that all employers have the same information about workers, we 
provide a preliminary discussion of the implications of models in which the current employer has an 
information advantage. 

In Section 2 we present our basic theoretical framework in a setting in which information is public, 
and then informally discuss the case in which it is private. We also consider the effect that associations 
between s, z, and job training would have on the analysis. In Section 3 we discuss the NLSY data used in 
the study. In Section 4 and 5 we present our results for education and race. In section 6 we present results 
in which we control for job training. In section 7 we discuss the case in which employer information is 
private and provide some evidence on how hard to observe variables are related to the probability of a 
layoff and the wage losses associated with layoffs. In section 8 we point out that interpreting our 
estimates of the time profile of the effect of AFQT on wages as the result of employer learning implies that 
high ability workers would have a substantial financial incentive to take the AFQT to differentiate 
themselves from those who are less able in this dimension. The fact that we do not generally observe this 
raises additional research questions. In section 9 we close the paper with a discussion of some of the 
implications of our analysis for a number of standard topics in labor economics and a research agenda. 



2. Implications of Statistical Discrimination and Employer Learning for Wages. 

2. 1 A Model of Employer Learning and Wages 

In this section we show how the wage coefficients on characteristics that employers can observe 
directly and on characteristics they cannot observe directly will change with experience if employers 
statistically discriminate and become better informed about workers over time. 



3 We are using the term "statistical discrimination" as synonymous with the use of the term “rational expectations” in the 
economics literature. We mean that in the absence of full information, firms distinguish between individuals with different 
characteristics based on statistical regularities. In other words, we mean that firms form stereotypes that are rational in the 
sense that they are consistent with reality. Many papers that use the term statistical discrimination analyze race or gender 
differentials that arise because firms have trouble processing the information they receive about the performance of minority 
group members. This difficulty may lead to negative outcomes for minorities because it lowers their incentives to make 
unobservable investments that raise productivity. It also may lead to negative outcomes if the productivity of a job match 
depends on the fit between the worker and the job. Some papers also consider whether firms that start with incorrect beliefs 
about the relationship between personal characteristics and productivity (inaccurate stereotypes) would correct them, and, in 
models with worker investment, whether the priors held by firms may be self fulfilling. See Aigner and Cain (1977), Lundberg 
and Startz (1983), Lang (1986), and Coate and Loury (1993) and Oettinger (1996). In Oettinger’s model productivity is match 
specific and productivity signals are noisier for blacks than whites. As a result the sorting process across job changes is less 
efficient for blacks, and a race gap develops over time. 
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Our model is very similar to Farber and Gibbons (1996). Let y it be the log of labor market 
productivity of worker i with t ( years of experience. y it is determined by 

(1) Yu = rs ( + H(t ; ) + a,qi + ^ 

where s ( is years of schooling, z x is a vector of correlates of productivity that are not observed directly by 
employers but are available to the econometrician, and H(tj) is the experience profile of productivity. The 
variable t|j consists of other determinants of productivity and is not directly observed by the employers or 
the econometrician. The elements of Zj might be a test score, the income of an older sibling, father's 
education, or indicators of childhood environment such as books in the home or ownership of a library 
card. We normalize z ; so that all the elements of the conformable coefficient vector A are positive. 

Without loss of generality we scale T| { so that it has a unit coefficient in the productivity equation. 

In addition to s i; the employer observes a vector q of other information about the worker that is 
relevant to productivity. The elements of q s are related to productivity by the coefficient vector a, For 
now we assume that the experience profile of productivity does not depend on s ( , z ( , q„ or q r In section 2.2 
we discuss the sensitivity of our analysis to this assumption. In most of the analysis we suppress the i 
subscript. All variables are expressed as deviations from population means. Although we use years of 
schooling and race as our examples of s, our analysis applies to any variable that employers can easily 
observe. 

We assume that the conditional expectations of E(z|s,q) and E(q|s,q) are linear in q and s, so 

(2) z = E(z|s,q) + v = y, q + y 2 s + v 

(3) q = E(q|s,q) + e = a 2 s + e , 

where the vector v and the scalar e have mean 0 and are uncorrelated with q and s by definition of an 
expectation. 4 The links from s to z and q may be partially due to a causal effect of s. 5 Equations (1), (2) 
and (3) imply that Av + e is the error in the employer's belief about the log of productivity of the worker at 



4 The exclusion of q from the conditional mean of r) is innocuous, since we are simply defining r) and the coefficient vector a, 
on q in ( 1 ) so that the mean of r\ does not depend on q. 

5 For example, below we use the Armed Forces Qualification Test (AFQT) as z and years of education as s, and Neal and 
Johnson (1996) present evidence that years of education have a sizable positive effect on AFQT. 
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the time the worker enters the labor market. The sum Av+e is uncorrelated with s and q. We make the 
additional assumption that Av +e is independent of q and s. 

Each period that a worker is in the labor market, firms observe a noisy signal of the productivity of 
the worker, 

(4) i t = y + e t 

where y is y t - H(t) and e t reflects transitory variation in the performance of worker i and the effects of 
variation in the firm environment that are hard for the firm to control for in evaluating the worker. (We 
continue to suppress the i subscripts.) The term e t is assumed to be independent of the other variables in 
the model. We are also implicitly assuming that the component of e, that reflects temporal variation in 
productivity from sources specific to worker i is serially uncorrelated, because otherwise firms would have 
an incentive to base compensation in t+1 on what they know about the worker specific component of e,. 6 
However, e, may be serially correlated as a result of the other factors. 

Since the employers know q and s, observing £ t is equivalent to observing 

(5) d t =Av + e + e, = £ t -E(y|s,q) 

The vector D t ={d„d 2 ,. summarizes the worker's performance history. Let p, be the difference 
between Av+e and E(Av+e|D,). By definition p, is uncorrelated with D„ q and s but in addition we assume 

p, is distributed independently of D„ q and s. 

We also assume for now that q, s, and the worker's performance history (summarized by the vector 
D ={d„d 2 ...d t } are known to all employers, as in Farber and Gibbons. (We discuss the private information 
case in Section 7.) As a result of competition among firms, the worker receives a wage W, equal to the 
expected value of productivity Y t (Y t = exp(y,)) times the multiplicative error component exp(^ t ) that 
reflects measurement error and firm specific factors that are outside the model and are unrelated to s, z, and 

q. The wage model is 

(6) W t = E(Yt|s,q,D t )e f ' 

Using (1), (2), (3) and (6) leads to 



6 The firm’s knowledge of a serially correlated productivity component would imply serially correlated transitory variation in 
the wage error of the type found by Farber and Gibbons (1996), but would not have much effect on our analysis. 
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(7) W, = E(Y,|s,q,D,)e ?1 = e ra+H(,) e (ai+Ay ' )q+(ai+A, ' j)s e E(Av+e|D ' ) E( e "-) e f ’ 

Taking logs and collecting terms leads to 

(8 ) w, = (r + Ay 2 + a 2 )s + H*(t) + (Ay, + a,)q + E(Av+e |Dt) + <;, 

where w t = log(W,) and H*(t) = H(t) + log(E(e^)) • We will suppress the <;, term in the equations that 
follow. 

In the context of the debate over signaling models of education Riley (1979) and others have noted 
that unless the relationship between schooling and actual productivity changes, the coefficient on s will not 
change. This is true regardless of why s is related to productivity. Farber and Gibbons also make this 
point by showing in a similar model that the expected value of the coefficient of an OLS regression of w t 
on s does not depend on t. They estimate an equation of the form 

(8a) w t = b st s + H(t) + a,q + E(Av + e|D,) 

with q treated as an error component. They find that b st does not depend much on t. 

Farber and Gibbons also make a second point, which is that if one adds the component z of (Av + e) 
that is uncorrelated with the employer's initial information s and q to the wage equation and estimates 

(8b) w t = b st s + b^'H- H(t) + a,q + E(Av + e|D,), 

the coefficients on s do not depend on t. This follows almost immediately from the first result, because 
adding a second variable to a regression model has no effect on the expected value of the first if the two are 
uncorrelated. They provide evidence from NLSY that b st is relatively constant and b z , t is increasing in t. 

In this paper, we establish a related set of results that permit one to examine the issue of statistical 
discrimination. We begin with the case in which z and s are scalars and then consider the more general 
cases. Among those who are working the means of q, s, and z may depend on t although this will influence 
estimates of H*(t). However, we assume throughout that among those who are working the covariances 
among q, s, and z do not depend on t. Under these assumptions the variances and covariances involving q, 
s, and z and the regression coefficients O qs and <J> qz defined in (10) below do not vary with t. 7 



7 Estimates of the experience profile H’(t) will be affected if the means of s, q, and z depend on t but this has no bearing on our 
analysis. 
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Case 1 : z is a scalar. 



The analysis is cleanest when z and s are scalars. Least squares regression will identify the 
parameters of the expectation of w t on s, z, and experience profile H*(t). 8 Let b st and b zt be the coefficients 
on s and z in the conditional expectation function when t=0. . .T, with 

(9) E(w t |s.z.t)= b st s + b^z. + H*(t) . 

When the individual starts work (t is 0) this equation is 

(9a) E(w 0 |s.z.0)= b s0 s + b^z + H*(0) 

To simplify the algebra but without any additional assumptions we re-interpret s, z, and q as components 
of s, z, and q that are orthogonal to H*. Then the wage process (7) , the fact that E(Av+e|D 0 ) is 0 (since 
there is no work history when t=0), and some straightforward algebra involving the least squares regression 
omitted bias formula implies that 
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where $ qs and $ qz are the coefficients of the auxiliary regression of (a, + Ay,)q on s and z.. The 
parameters {b st , b zt }' are the sum of the {b s0 , b^}’ and the coefficients of the regression of E(Av+e|D,) on s 
and z. That is, 
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where |Var(s,z)| is the determinant of Var(s,z) and we use the facts that cov(s, E(Av+e|D,))=0 and 
cov(z,E(Av+e|D,))=cov(v,E(Av+e|D l )). This may be rewritten as 
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(12a) b st = b s0 + 6 t 0> s 



8 Technically, it identifies the coefficients of the least squares linear projection of w t on s, z, and H*(t) if E(Av+e|D t ) is not 
linear in the functions of s, z, and t we introduce in our regression models. We ignore this distinction. 
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(12b) b„ = b„ + e,4>, 

where <$ s and O z are the coefficients of the regression of Av+e on s and z and 

0, = cov(E(Av+e|D,), z)/cov(Av+e, z)= cov(E(Av+e|D,),v)/cov(Av+e, v) 

is a parameter that is specific to the experience level t. Note that 0 t O s and 0 t O z are the coefficients of the 
regression of E(Av+e|D,) on s and z and that 0, summarizes how much the firm knows about Av + e at time 
t. It is easy to show (see Appendix 1) that <£ s = - O zs O z where is the coefficient of the regression ofz 
on s. (This is the basis of proposition 3 below.) 

To determine the behavior of 0,0 s and 0,0 z over time, note first that O s < 0 and O z > 0 if cov (v, 
Av+e) > 0 and cov(s,z) >0. The latter condition is true when s is schooling and the scalar z is AFQT, 
father's education, or the wage rate of an older sibling. The condition cov(v,Av+e) > 0 simply states that 
the unobserved (by the firm) productivity subcomponent v and composite unobserved productivity term 
Av+e have a positive covariance. This seems plausible to us for the z variables we consider. 

The change over time in b s , and b^is determined by 0,. Intuitively, 0, is bounded between 0 and 1. 

It is 0 in period 0, because in this period employers know nothing about Av + e, so E(Av+e|D 0 )=0. The 
coefficient is 1 if E(Av+e|D,) is Av+e, since in this case the employer has learned what Av+e is and thus 
knows productivity y . It is also intuitive that 0, is nondecreasing in t because the additional information 
that arrives as the worker's career progresses permits a tighter estimate of Av+e. 9 

The regularity conditions on the e, process that are required for the time average of e, to converge 
almost surely to 0 as t becomes large constitute sufficient conditions for 0,to converge to 1 as t becomes 
large. (See Theorem 3.47 in White (1984) for a very general set of conditions.) These conditions limit the 
degree of independence among the e, and also restrict the variances. The intuition for this is that future 
values of e, must be sufficiently independent of the earlier e's to average out, and must not be so variable 
that the future d, values have no new information about Av+e. 10 



9 To establish this note that since D,., is a subset of the information in D„ 
[Cov(v, E(Av+e|D t ) - E(Av+e|D,.,))]/Cov(v, Av+e) = 0,- 0 M * 0. 



10 To establish the result note that in each period, firms observe d, =Av + e + e, . In general, the form of E(Av+e|DJ will depend 
on the pattern of serial correlation and the relative variances of e,...e t . However, the firm can always choose to use E(Av+e|DJ, 

t 

where D { is the time average £ (Av + e + s k )/t,asan unbiased but perhaps inefficient estimator given D t . If as t goes to 

i 

infinity D converges almost surely to Av+e, then Cov(E(Av+e|Dt, v)/Cov(Av+e,v) converges to 1 as t goes to infinity. Since 
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A simple example may be helpful. If e,is iid with variance o € 2 , then 0, has the familiar form 
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In this case, 0, is strictly increasing in t because the independence among the e, means that each e, has 
some new information about 0 t . 0, is 0 when t is 0 and converges to 1 as t goes to infinity. 

There are two conclusions, which we summarize in Proposition 1 and 2: 

Proposition h Under the assumptions of the above model, the regression coefficient b Z ( is nondecreasing 
in t. The regression coefficient b s t is nonincreasing in t." 

Proposition 2 : If firms have complete information about the productivity of new workers, then db s fidt = 
cb z t/dt = 0. 

These results underlie our empirical analysis below. Using AFQT and father's education as z 
variables, we examine the experience profile of b st and b*. The intuition for the decline in b s , is that as 
employers learn the productivity of workers, s will get less of the credit for an association with 
productivity that arises because s is correlated with z provided that z is included in the wage equation with 
a time dependent coefficient and can claim the credit. 12 We also are able to estimate the time profile of 0, 
up to scale. Under the assumption that employers learn about v and e at the same rate, this enables us to 
estimate the time profile of employer learning about productivity up to scale. In AP (1996) we examine 
the implications of our estimates for pure signaling models of the return to education. 

The model also implies a third result, which we state in proposition 3. 

Proposition 3 : Under the assumptions of the above model, db s fdt = -& zs ob z fd. 

Since is simply the regression coefficient of z on s and can be estimated, the coefficient restriction in 
Proposition 3 may provide leverage in differentiating between the leaming/statistical discrimination model 
and alternative explanations for the behavior of b st and b*. 



E(Av+e|DJ is more efficient than E(Av+e|Dj|, E(Av+e|D,) must also converge almost surely to Av+e , which establishes that 
Cov(E(Av+e|DJ, v) / Cov(Av+e, v) converges to 1. We conclude that 0, converges almost surely to 1 as t becomes large. 

11 The coefficients on an unfavorable z characteristic, such as criminal involvement or alcohol use, will become more negative 
to the extent that these reflect permanent traits. Assuming s is negatively correlated with the unfavorable z, b„ will rise with t. 
As noted earlier, we have normalized z so that A > 0. 

12 It might be particularly interesting to see if the "diploma effect" declines with t while the coefficients on hard to observe 
productivity characteristics that correlate with getting a diploma rise. (See Frazis (1993) for a recent analysis of whether there is 
a diploma effect.) 
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Additional Empirical Implications 

As noted in footnote 4, the literature on statistical discrimination as well as the literature on labor 
market networks has emphasized differences across groups in the amount of information that is available to 
firms (or the mapping between a given set of data and what the firm actually knows) may differ across 
groups. Our model implies that these differences will lead to group differences in wage dynamics. To see 
this, suppose that there are two groups, 1 and 2. For group 2 the firm's initial information set is larger than 
for group one. Consequently, Var(Av + e | group 2) < Var(Av + e | group 1) and cov(Av + e, v | group 2) < 
cov(Av + e, v | group 1). From equation (10') or (11), it follows that b st and b* vary less over time for 
group 2 than group 1 . In the extreme case, when firms are fully informed about group 2, cov(Av + e, v | 
group 2) is 0 and b st and b a are constant. In future work, it would be interesting to use this implication as a 
way of testing the hypothesis that the quality of information that employers have differs across labor force 
groups. Theories that stress differences in the ability of employers to evaluate the performance of 
members of different groups imply different amounts of noise (from the point of view of the employer) in 
the signals d, and different paths of 0 t . 

In standard labor data sets based on household interviews information on y ; is not available. 
However, it is interesting for at least two reasons to discuss the cross equation restrictions between the 
equation relating y, to s and z that are implied by the model. First, data could be gathered from both firms 
and workers. Second, information on y { or indicators of y ; may be available for use in other applications of 
our methods to study statistical discrimination. For example, in the study of mortgage lending, panel data 
on households might provide data on both credit records (related to y ( ), success in loan applications (the 
counterpart to w it ), and hard to observe background variables (such as the income and wealth of relatives). 
Suppose that one has a measure y it * that is equal to y { plus noise Assume that <^ it is independent of all 
other variables in the model. Then the model implies that 

(13a) y it ’ = (b s0 + $ s ) S; + (b^ + Ojz; + error 

where the error term is orthogonal to s and z. Note that the coefficients are time invariant. This equation 
and (9) are heavily overidentified. By estimating the equations jointly one can identify 0, separately from 
<b s and O z . The availability of a productivity indicator would be particularly useful when one relaxes the 
assumption that the effect of s and z on y is time invariant. 
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Case 2: z Is a vector 



(14) 



We now consider the case in which z is a vector z - In this case, 



bso 





r+A^ 2 + ct 2 
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Oqs 


.bzo. 




0. 




Oqz 



where [<£ qs , <£ qz ]' is the 1 x (K+l) vector of coefficients from the auxiliary regression of (a, + Ay,)q on 
s and z. 

In the vector case [b st , bj are: 



(15) 
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_b zt _ 



bso 

bzo. 



+ var(s,z) 1 
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co v(z, E(A v+e|D t j 



where var(s,z) is a K+l x K+l matrix, cov(z, E(Av+e|D,)) is a K element vector and we have used the 
fact that cov(s, E(Av+e|D,))=0. 

Let G k be the k* row of the K x K matrix G = [vai(s)vai(z)-cov(z,s)cov(z,s)']~' 
where var(s) is the variance of the scalar s, var(z) is the variance of the vector z, and cov(z,s) is the 
vector of covariances between s and the elements of z. In appendix 1 we show that b st and b zkt can be 
expressed as 

n 

(16) b„ = bso - ^(cov(s, Zk) Gk ■ 0« • [cov(z , A v+ e j]) 

k~! 



(17) b Zk , = b Zk o+var(s)G k -0kk[cov(z,Av+e)] 

where = cov(E(Av+e|D,), z k )/cov(Av+e, zj. Let 0 ZtJ be the coefficient of the regression of z k on 
s, k=l,. . .,K. Equations using (16) and (17) lead to Proposition 4, which is the vector analog to 
Proposition 3. 



Proposition 4: When z is a vector and the assumptions of the above model hold, 
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Proposition 2 generalizes to the vector case. Proposition 1 does not. With multiple z variables, 
one cannot in general sign db st /dt and d>^ t / d even if all the elements of A are positive, each element 



of cov(z, Av + e) is positive, and 0 2jJ is positive and ©i* is increasing in t for all k. However, from 
Proposition 4, it follows immediately that if do^ x Id > 0 and b Zkt > 0 for all z* used in the analysis 
then dbjdt is < 0. We can verify the conditions for a particular s and set of z variables. 13 

If the ©« are the same for each of the z k and equal to the common value 0 t , then the time paths 
of b st and the elements of b^ will all be proportional to 0, : 



(17a) 



j0 



J z0 _ 



+ 0 . 



-<D <D 
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The ©« will be the same for all k if the following two conditions on the conditional distribution f of 
D t and the conditional mean of z k hold: 

Condition 1. f(D„ |z k ,Av + e) = f(D,|Av + e) for all k ; 



Condition 2. E(z k | Av + e) = (J) k R(Av + e) where 4> k is a scalar that is specific to k and R is some 
function. 

Basically, these conditions imply that the distribution of the signal D, is driven by e + Av and 
that the signal D t is not more informative about particular elements of v than others. 14 The condition 



13 Initially we were surprised that the conditions that A > 0, cov(z, Av + e ) > 0, and ©^ is increasing in t for all k do not 
guarantee that b n is positive even if <h 2ks > 0 for all k. The intuition is as follows. The OLS estimator of is equivalent to 
regressing the wage in period t on the residuals z k from the regression of Zj. on s. z k is the sum of v k plus the component of the 
kth element of Y,q that is orthogonal to s. The components Yiq that are orthogonal to s are unrelated to v k and e but are likely 
to be correlated across z k . Consequently, using OLS to estimate b^, is analogous to applying OLS in a situation in which 
several of the regressors are measured with error, and the measurement errors are correlated. (The z k may be thought of as 
noisy measures of v k .) It is possible in such a situation for the probability limit of the OLS estimator to take on the wrong sign. 



14 To establish the conditions, note first that 

Cov((E(Av +e|Di),Zk)) = { { JzkE(Av+e|D 1 )g(D l ,Zk|Av k )h(v) 

Av+ezkDt 

= J J Jz k E(Av+e|Di) g(D.|Av) f(zk|Av) h(Av) 

Av+eztDi 

= | jE(zkl Av) * E(E(Av +e|Di)| Av) • h(Av) 

Av+ezt 

and that 

Cov(Av +e,z0 = J E(zk|Av+e)-(Av+e)*h(Av+e). It is easy to verify from these equations that Cov(E(Av + e|Dt, 

Ay +e 

zJ/CovfZklAv + e) is the same for all z,. if (EfzJAv +e) =4) k v P(Av + e) . 
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will hold if, for example, d, is generated by (5) and e and the elements of v are normally distributed. 

The conditions rule out the possibility that the range of a particular element of v, say v k , is either 
-100,000, 0, or 100,000. In this case, a very small or very large value of D t would be very informative 
about v k . 

These conditions are quite strong. For example, if the firm obtains indicators about 
subcomponents of v and e as well as y, then it is likely to learn about some components of productivity 
faster than others. In this case, equation (16) and (17) continue to hold, but the time path of the 
education slope is a weighted average of the 0** that determine the time paths of the individual 
The paths of the individual z k will reflect differences across z^n the rate at which firms learn about the 
productivity components that they are correlated with. This is an important result, because it states 
that differences in the effects of particular variables on wage growth may reflect differences in the rate 
at which firms learn about the variables. This provides an alternative or a complement to the standard 
view that the differential effects on growth rates reflect differences in the relationship between the 
variables and other sources of wage growth such as on-the-job training. 

Case 3: s and z are both vectors. 

Finally, we consider the case in which both s and z are vectors. In this case we reinterpret all of the 
of the related variables and parameters in the model, such as b s0 , b^, v, etc as vectors or matrices. The 
vectors of coefficients b s0 and b^ on s and z in the base year satisfy (14) where the vectors <& qs and <& g2 are 
the coefficients in the regression of q on s and z. The vectors b st and b zl are given by 



(17b) 




+ y4[cov(v,£'(Av + e|£),)] 



where 



A = 



- var(s)‘' cov(s,z)[var(z) - cov(z,s) var(.s) 1 cov(s,z)] 
[var(z) - cov(z,s) var(s)“' cov(s,z)] 
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Since -var(s)''cov(s,z) is the matrix of coefficients from the regression of z on s we obtain the vector 
version of proposition 4: 

1 ~t>s o = ®zs(b z , ~b z0 ) , 

where is redefined to be the matrix of coefficients of the regression of the vector z on the vector s. 
When conditions 1 and 2 are satisfied and the signal D t is not more informative about particular elements 
of v than others then (17b) reduces to (17a) with both b st and b n as vectors. . 

Statistical Discrimination on the Basis of Race 

Firms observe race. If race is correlated with productivity and firms violate the law and use 
race as information, then race has the properties of an s variable. To see the empirical implications of 
this, partition s into two variables, s, and s 2 , where s, is an indicator variable for membership in a 
particular racial group and 0 otherwise, and s 2 is schooling. 15 In this case, the model implies almost 
immediately that the coefficient on s, does not vary over time if the interaction between z and t is 
excluded from the model. If this interaction is included (17a) implies that the time paths of b slt and b s2t 
are 

b s , t =b sl0 - 0^,0, , 

bs2t~b S 20 ■ ^zs2®t 

where and are the coefficients on s, and s 2 in the regression of z on s, and s 2 . Assuming 
is negative, as it is when s, indicates that the person is black and z is AFQT, father’s education, or the 
wage of an older sibling, then the wage coefficient on s, will rise over time. 

In contrast, if firms obey the law and do not use race as information, then in the econometric 
model, race has the properties of a z variable. In the case in which race is the only z variable and one s 
variable, such as education, is included in the analysis, then the coefficient on z in equation (11) 
corresponds to the coefficient on race. The model implies that if (i) race is negatively related to 
productivity (A < 0), (ii) firms do not statistically discriminate on basis of race, and (iii) firms learn 
over time, then the race differential will widen as experience accumulates. The intuition is that with 

15 The element of r corresponding to the race indicator s, in the productivity equation (1) is 0 unless consumer or employee 
tastes for discrimination reduce profitability of employing members of the minority group, as in Becker (1971). (Even if r is 0 
race may be negatively related to productivity if it is correlated with elements of z, q, or q that affect productivity.) 
Presumably, firms that violate the law and discriminate in response to their own prejudice or the prejudice of consumers or 
other employees might also be willing to use race as information. Employers who harbor prejudice against certain groups may 
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learning firms are acquiring additional information about performance that may legitimately be used to 
differentiate among workers. If race is negatively related to productivity, then the new information 
will lead to a decline in wages. If education is negatively related to race, then the coefficient on 
education should fall over time. 

What happens if firms do not discriminate on the basis of race and one adds a second z variable 
with a time varying coefficient to a model that contains race and an s variable? Let z, denote race and 
Zj denote the additional variable, and let b zlt denote the coefficient on race when experience is t and Zj 
is included in the model and let b zll * denote the corresponding coefficient when Zj is excluded. 

Assume that ©{ , = © 22 = 0 X where 0« is defined below (17) above. In Appendix 3 we show that 

cb lU ia-a>J ia=-s>e,ia-[ o 2 o 2j2i ] 

where O z2 is the coefficient on z 2 in the regression of Av + e on s, z, and z 2 and 0 2;2i is the coefficient 
on z, in the regression of z 2 on z, and s. When z, indicates whether the person is black and Z 2 is AFQT, 
father’s education, or the wage of an older sibling, is negative. If these variables are positively 

related to productivity, with 0 2j > 0 then cb zU Id- ob zU ' / d > 0 . We conclude that if firms do not 

statistically discriminate on the basis of race and race is negatively related to productivity, then (1) the 
race gap will widen with experience and (2) adding a favorable z variable to the model will reduce the 
race difference in the experience profile. We wish to stress that other factors that influence race 
differences in experience profiles as well as other forms of discrimination will also influence the wage 
results. 

2.2 Incorporating On-the-Job Training Into the Model: 

The analysis so far assumes that the effects of z and s on the log of productivity do not depend 
on t. Human capital accumulation is included in the model through the H(t) function but is assumed to 
be “neutral” in the sense that it does not influence the time paths of the effects of s and z. 16 In the more 
general case, the time paths of z and s depend on other factors as well as learning. In this section we 
first consider the effect that such dependence would have on OLS estimates of the interactions between 

be especially unlikely to form beliefs about the productivity of those groups that are rational in the statistical sense used in this 
paper. 

16 One may easily modify the theoretical framework to allow for this form of human capital accumulation. For example, the 
H(t) function may reflect learning by doing in all jobs that is observable to firms, or worker financed investments in human 
capital that are observable to firms. 
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t and z and s. Then we discuss estimation of a more realistic model that includes both human capital 
accumulation and leaming/statistical discrimination. As we shall see, there is no clean way to sort out 
the relative roles of these two mechanisms without data on productivity. 

Suppose that s is complementary with learning by doing or enhances the productivity of 
investments in general skills. We return to the case of scalar z. Then the productivity equation (net of 
training costs) might take the form 

(18) y, = rs + r,st + H(t) + a I q+Az + r| . 

Assuming that the training activity is observed (firms know (18)) and workers pay for the 
general training, the wage equation (9) becomes 

(19) w t = (b st + r,t) s + b^z. + H(t) + a,q + E(Av + e|D,) 

Most discussions of human capital and most of the empirical evidence on employer provided 
training suggest that education makes workers more trainable and that educated workers receive more 
training. In this case r, will be greater than 0. 17 Probit models of the probability that a worker 
receives training in a given year show strong positive effects of schooling, and AFQT as well as 
smaller but positive, statistically significant effect of father’s education. (See below.) 

What are the implications of this for our investigation of the hypothesis that the reliance of 
employers on easily observable variables to estimate productivity declines over the career? In 
estimating the model we identify the sum b st +r,st rather than b st . If r, is greater than 0, then the 
estimated relationship between b st + r,st and t will be biased against the hypothesis that employers 
leam about productivity. As it turns out, we find a strong negative relationship between b st + r,st and t, 

« which is only consistent with a training interpretation if education reduces learning by doing, the 
productivity of training investments, and/or the quantity of training investments. 

There is also the possibility that the productivity of employer provided training and/or learning 
by doing depends on z and/or T). This case is harder to analyze because employers do not observe z 
and r| directly and are learning more about them as time goes on. As a start, we consider the extreme 



17 Earnings slopes depend on the expected productivity of the worker if the costs or returns to training depend on variables such 
as z or s. Altonji and Spletzer (1992) find a relationship between test scores and measures of tra inin g using the NLS72 data set, 
and many studies find a link between schooling and training measures. See for example, Bartel and Sicherman (1992) and 
Lynch (1992). 
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case in which firms are fully informed about z, so that 0 t is 1 and b rt in (9) is a constant in the absence 
of training. 

Suppose that the productivity equation is 
(20) y t = r s + r,s t + r 2 z t + H(t) + a,q +Az + T| , r 2 > 0 

If firm’s knowledge of s and q is fully informative about z, then the presence of r 2 in the productivity 
equation should lead the effect of z on the wage to rise with experience even if b rt does not depend on 
time ( 0 ,= 1 ). However, the presence of r 2 z t in the productivity equation seems unlikely to lead to a 
negative estimate of <3b st /<3t. 

It is important to point out, however, that if the effect of z on y rises with t then introducing the 
interaction between z and t into the wage equation could lower the estimate of the change over time in 
the wage response to s. Let B st be the expectation of the OLS estimator of the effect of s on the wage 
in period t. Then B st is b s0 + r,t + r 2 1, where is the coefficient of the regression of z on s. When 
one adds z t to the regression, B st becomes b s0 + r, t. B rt , the expectation of the OLS estimator of the 
effect of z in period t, becomes b* + r 2 1 . If > 0 and r 2 >0, then r 2 > 0. The change in the 
coefficient on s when z t is added is r 2 t. Consequently, in the scalar case the simple training 
model with full information about z implies that [dB st /dt] declines by - Ozs [dB^dt] when z t is added 
to the wage equation. 

In the pure employer leaming/statistical discrimination model dB st /dt is equal to db s / 8t and, 
according to proposition 3, the learning model also implies that dB st /dt declines by -O zs 5b zt = -O^dt 
when z t is added to the wage equation. However, models differ in their implications for the level of 
dBJdX after z t is added. A pure human capital model with perfect information implies dBJdt > 0 
unless, in contrast to the available evidence, s has a negative partial effect on the quantity or return to 
on-the-job training (r, < 0). 

Controlling for Training. 

In the absence of data on productivity, sorting out the relative importance of employer learning 
and non neutral (with respect to z and s) on-the-job training may require that one build a model of the 
quantity of training as a function of s and z and use a proxy based on the training model to control for 
the effects of non neutral general human capital accumulation in the wage equation. This raises a 
number of difficulties that we explore in the next few paragraphs. 
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We return to the case of scalar z. Assume that the productivity equation (net of training costs) 
takes the form 

(21) y t = r s + ¥(2^) - C(T jt ) + H(t) + a,q + q . 

where 2T iT = S T=l t T iT , ¥(•) is an increasing function that summarizes the effect of accumulated 
traimng on productivity, and C(T it ) is the cost in terms of the log of productivity of T it units of training 
in period t, and the function H(t) has been redefined to accommodate the inclusion of training. 

Assume that T it is determined by employer beliefs about productivity given D t , q, s, and t, as well as by 
D„ q, s, and experience. Then 

(22) T it = h(D„ q, s, t) = r(s,z,t) + u, 

where r(s,z,t) is E(h(D„ q, s, t)|s,z,t) and u, is an error term that is related to q and D t but is assumed 
independent of s, z, and t. Following through on a series of substitutions that parallels those leading 
to (8), and assuming that the worker pays for and receives the returns to the general training yields the 
wage equation 

(23) w t = (r + y 2 + a 2 )s + ¥(2 T jt ) - C(T it ) + H*(t) + A(y, + a,)q + E(Av+e |Dt) + q t 
Suppose that up to an irrelevant constant ^(St., , T it )) = V|/, S T .,„ t T it and C(T it ) = c, T it . 

Then the regression function relating w t to s,z, 2T iT , and T it in period t may be written as 

(24) w t = (r + a lt )s + (y.+a^S T iT + (a^-c,)^ + a 4t z + H*(t) + 

where a lt , a*, and a 4t are the coefficients of the linear least squares projection of A(y, + a,)q + 
E(Av+e |DJ onto s, 2 T it , T jt , and z, and the error term is unrelated to the variables in the model 
by definition of a lt a^, aj„ and a 4t . The time path of a lt and a 4t will be influenced by changes over time 
in the correlations of s, z, and A(Yi + cc,)q with 2 T iT and T it as well as changes over time in the 

correlations of z, 2 T iT , and T jt with E(Av+e |Dt). (The coefficients of the experience profile H*(t) 
will be influenced as well.) 

Two implications follow from (23) and (24). First, even if training depends only on 
information that is known to the firm at the start, the relationship between q and T it and 2T it may 
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change with t, leading to changes over time in the coefficient on s even if there is no learning. The 
second point follows from the fact that training depends on D t and so will be correlated with it. The 
least squares estimates of the coefficients on the training variables will reflect both the direct effect of 
training and a relationship between the time path of T it and E(Av+e |D,). As a result, the effect of 
adding the training variables to the model on b st and b* is complicated in a mixed human 
capital/employer learning model. In particular, one might expect the addition of functions of T it and 
ST it to the model to change and quite possibly reduce the rate of increase of b*, for two reasons. First, 
the training variables change over time and are positively correlated with z. Second, they will absorb 
part of the trend in E(Av+e |Dt), and it is changes in this term that induce the variation with t in b s , and 
b*. Furthermore, the introduction of the training terms alters the partial correlation between z and s, 
which changes the effect on the path of b st of introducing z with a time varying coefficient. 

Unfortunately, we have do not have a way to isolate the effects of training from the effects of 
statistical discrimination with learning if, as seems plausible, the quantity of training is influenced by 
the employer beliefs about productivity. Consider the null hypothesis that (1) learning is important, (2) 
variation with s and z in the rate of skill accumulation is not, and (3) variation in our measure of 
training is driven by worker performance (which leads to promotion into jobs that offer training) 
rather than by exogenous differences in the level of human capital investment. Even under this 
hypothesis one would expect the introduction of the training measures to lead to a reduction in the 
growth over time in the coefficient on z and a reduction in the impact of z on the time path of the 
coefficient on s. With an indicator of y it , that problem is easily solved, but we lack such an indicator. 

Despite the absence of a clear structural interpretation of the results we think it is important in 
this initial study to see how introducing measures of training alters b st and b a . Consequently, below 
we report estimates of (24). There are two additional problems in using the training data. First, the 
measure T* jt of T it is almost certain to contain measurement error. Second, the quality of the training 
data prior to 1988 is too poor to be used, which means that the data needed to form the measure ST it is 
missing for persons who left school prior to that year. We do not have a solution for the first problem 
but deal with the latter problem by estimating a flexible model relating T* it to s, z, and t using data from 
1988-1993 and using the model to impute values in the earlier years. 18 We estimate variants of (24) 



18 Spletzer and Lowenstein (1996) provide means of dealing with measurement error in the training data but these are beyond 



the scope of our study. 
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below. Our preferred specification is a wage growth model based on the first difference of (24). The 
growth specification has the advantage of only requiring data on T it and T it .,. Perhaps more 
importantly, this specification also eliminates bias from unobserved person specific effects that are 
known to firms and are correlated with both training and wages. 

3. Data 

The empirical analysis is based on the 1992 release of NLSY. The NLSY is a panel study of 
men and women who were aged 14-22 in 19J8. Sample members have been surveyed annually since 
1979. (In 1994 the NLSY moved to a biannual survey schedule.) The NLSY is an attractive data set 
for the study of employer learning and statistical discrimination. First, the sample sizes are large. 
Second, sample members are observed at or near the start of their work careers and are followed for 
several years. Third, the NLSY contains detailed employment histories, including reasons for job 
changes. Fourth, it contains a rich set of personal characteristics that may be related to productivity 
and may be hard for employers to observe, including father and mother's education and occupation, 
drug and alcohol use, criminal activity, AFQT, aspirations and motivation, and performance in school. 

Furthermore, the data set contains a large number of siblings. The earnings of older siblings as well 
as parents may be used as indicators of characteristics of younger siblings that affect productivity but 
are hard for employers to observe. Finally, it contains measures of training, which we need to 
investigate the possibility that variation with experience in the effects of schooling and our measures 
of hard to observe personal characteristics are due to a relationship between these variables and the 
quantity of training received. 

We restrict the analysis to men who are white or black who have completed 8 or more years of 
education. We exclude labor market observations prior to the first time that a person leaves school and 
accumulate experience from that point. When we analyze wage changes, we further restrict the sample 
to persons who do not change education between successive years. Actual experience is the number of 
weeks in which the person worked more than 30 hours divided by 50. Potential experience is defined 
as age minus years of schooling minus 6. To reduce the influence of outliers, father's education 
(F_ED) is set to 4 if father's education is reported to be less than 4. AFQT is standardized by age. 19 



19 The age of the sample members at the time the AFQT was administered varies somewhat in the NLSY sample. This 
induces some variation in schooling levels at the time the AFQT is taken. To calculate standardized AFQT, we adjust the raw 
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The means, standard deviations, minimum and maximums of the variables used in analysis are 
provided in Table A1 in the Appendix , along with the variable definitions. The mean of actual 
experience is 4.9. The mean of potential experience is 7.3, and the mean of education is 12.7. All 
statistics in the paper are unweighted. Blacks are over sampled in the NLSY and contribute 28.8 
percent of our observations. Table A2 reports correlation coefficients and simple regression 
coefficients that summarize the relationships among the key variables used in the analysis. 

4. Results for Education 

In Table 1-3 we report estimates of our basic wage level specification. In table 1 we use 
potential experience as the experience measure and use OLS to estimate the model. The equations also 
control for a cubic in experience, a quadratic time trend, residence in an urban area, and dummy 
variables for whether father's education is missing and whether AFQT is missing. We add interactions 
between the dummy variables for missing data and experience when interactions between father's 
education and experience and AFQT and experience are added to the model. These variables are not 
reported in the tables. Standard errors are White/Huber standard errors computed accounting for the 
fact that there are multiple observations for each worker. 

In column 4 we present an equation that includes s, Black, and s x t. This corresponds to (7a) 
with b st restricted to b st = b s0 + b sl x t. The coefficient on s x t/10 is -.0077 (.0062), suggesting that the 
effect of education on wages declines slightly with experience. In column 5 we add AFQT and F_ED, 
where F ED is years of father's education. As had been well documented, AFQT has a powerful 
association with earnings even after controlling for education. A shift in AFQT from one standard 
deviation below the mean to one standard deviation above is associated with an increase in the log 
wage of .157. The coefficient on education declines to .080 and b sl becomes more negative. 

In column 6 we add linear interactions between t and two different z variables, AFQT and 
F_ED, to the equation. The resulting equation corresponds to (9) with the restriction that b st = b s0 + 
b sl x t and b zt = b^ + b zl x t, except that we introduce two z variables rather than 1 . The estimates imply 



AFQT score by subtracting the mean score for each age and dividing by the standard deviation for that age. For individuals 
with siblings in the sample, the coefficients of the regression of the unadjusted test score of the older sibling on the test score of 
the younger sibling and the regression of the test score of the younger sibling on the the score of the older sibling are very 
similar after one also controls for age, suggesting that the information in the test is not very sensitive to age over the range in 
the sample. 
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that the effect of AFQT on the wage increases greatly with experience t. h AF Q T |, which is the 
coefficient on AFQTxt/10, i s .0820 (.0125). b A p QTt , which is dwt/dAFQT, rises from only .0179 when 
experience is 0 to .0999 when experience is 10. The results imply that when experience is 10 and 
education is held constant, persons with AFQT scores one standard deviation above the mean have a 
log wage that is .200 larger than persons with AFQT scores one standard deviation below the mean, 
while the difference is only .036 when experience is 0. The effect of father's education also increases 
with experience. The main effect is actually slightly negative (but not significant). However, the 
interaction term is positive, though not statistically significant. 

Our results for AFQT and F ED are consistent with Farber and Gibbon's results in which they 
use the components of AFQT and an indicator for whether the family had a library card when the 
person is 14 that are orthogonal to the wage on the first job and education. The key result in the table 
is that the coefficient on s*t/10 declines sharply (to -.0351 (.0069)) when AFQT*t and F_ED*t are 
added. The implied effect of an extra year of education for a person with 10 years of experience is 
only .0633. Strikingly, the coefficient on s rises to .0984 which is almost exactly what we obtain when 
we exclude all terms involving F ED and AFQT from the model (columns 1 and 4). 

These results provide support for the hypothesis that employers have limited information about 
the productivity of labor force entrants and statistically discriminate on the basis of education. Early 
wages are based on expected productivity conditional on easily observable variables such as education. 
As experience accumulates, wages become more strongly related to variables that are likely to be 
correlated with productivity but hard for the employer to observe directly. When we condition the 
experience profile of earnings on both easy to observe variables, such as education, and hard to 
observe variables, such as AFQT and father's education, we find the partial effect of the easy to 
observe variables declines substantially with experience. While one might argue that the positive 
coefficients on AFQT*t and F_ED*t are due to an association between these variables and training 
intensity, it is hard to reconcile this view with the negative coefficient on s*t. While measurement 
error in schooling may enhance the effect of F ED and AFQT and may partially explain the decline in 
s between columns 1 and 3, it does not provide a simple explanation for the behavior of the interaction 
terms with experience. 

In Table 2 we present OLS results using actual experience in place of potential experience as 
the experience measure t. The main difference between this table and table 1 is that the return to 
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education is lower and the s*t interaction is positive and fairly large in the equations that exclude 
AFQT x t and F_ED*t. However, the coefficient on sxt/10 declines from .0200 in column 5 to -.0056 
when the interaction terms are added in column 6 of Table 2. This decline is similar to the decline that 
we obtain in column 3. 

The results in Table 2 are difficult to interpret, because the intensity of work experience may be 
conveying information to employers about worker quality. It is an outcome measure itself. The 
implications of employer learning for the wage equation are changed if one conditions on information 
that becomes available to employers as the worker's career unfolds and may reflect the productivity of 
the worker. Conditioning on actual work experience raises some of the issues that would arise if we 
conditioned on wages in t-1 or on training received. On the other hand, the results based on potential 
experience are likely to be biased by the fact that potential experience mismeasures actual. For this 
reason, in Table 3 we report the results of re-estimating the models by instrumental variables (IV), 
treating all terms involving actual experience as endogenous with corresponding terms involving 
potential experience as the instruments. The results in columns 5 and 6 of Table 3 are basically 
consistent with those in Table 1 . The coefficient on AFQT is .0177 (.0096) and the coefficient on 
AFQT t/10 is .1148 (.0164). These estimates imply that conditional on years of schooling, AFQT has 
only a small effect on initial wages, but when t is 10 a two standard deviation shift in AFQT is 
associated with a wage differential of .247. The coefficient on s t/10 declines from -.0181 when the 
interactions are excluded in column 5 to -.0561 in column 6. 

Controlling for Secular change in the Return to Education 

In column 9 of Tables 1, 2, and 3 we add the interaction between s and calendar time to the 
model containing father’s education and AFQT. 20 In the case of potential experience in Table 1, the 
education slope is reduced by .02 per year, and the interaction between education and experience/10 
drops to -.051, but otherwise the results change little. In column 10 we add the interactions between 
calendar time and s, F ED, and AFQT to the model containing the interactions between t and all three 
variables. In column 10 the interactions between F_ED and AFQT and calendar time have positive 

20 Murphy and Welch (1992), Katz and Murphy (1992), Taber (1996) and Chay and Lee (1997) are among a large number of 
recent study of changes in the structure of wages in the U.S.. Since calendar time is positively correlated with experience t in a 
panel data set, the leaming/statistical discrimination model implies that estimates of secular changes in the return to education 
and AFQT will be biased in opposite directions if one fails to add the interaction between these variables and t to the model. 




O' 



? 



23 



coefficients, indicating that the effects of these variables rose during the 1980s. Adding the time 
interactions reduces the size of the experience interactions with F_ED and AFQT, but the qualitative 
pattern of the results does not change. 

Controlling for Occupation 

One objection to the theoretical framework underlying the estimates in Tables 1-3 is that it 
assumes that the flow of information to employers is independent of the type of job the worker begins 
in. This is contrary to the idea that some jobs are "dead end" jobs. Perhaps education (and high 
AFQT) enables a worker to gain access to jobs in which firms have the ability to observe whether the 
worker has higher level skills that are strongly related to productivity. As a simple check on this 
possibility, we present a series of equations in Table 4 that control for the 2-digit occupation of the 
first job. The results are very similar to what we obtain when occupation is excluded. 21 . 



The Effects of the Wage of a Sibling 

In Table 5, we use the wages of siblings with 5 to 8 years of experience as a hard to observe 
background characteristic. The coefficient on sxt/10 is -.0097 (.0089) in column 4, which includes the 
log of the wage of the oldest sibling. The learning model does not provide an explanation for the 
negative interaction term, nor does the conventional view of how education is related to on-the-job 
training. However, when we add the interaction between the sibling wage and t in column 5, the 
coefficient on the education interaction falls to -.0146, and the coefficient on the interaction between 
the sibling wage and t/10 is .086 (.0327). 22 The effect of the sibling wage rises from .127 upon labor 
force entry to .213 after 10 years of experience — a very large increase. The point estimate of the 
interaction between education and experience result is essentially unchanged when we allow the effect 
of sibling wage. In Table 5, columns 5 and 6, we show that these results are robust to allowing the 
effects of education and the sibling wage to depend on calendar time. Our interpretation of these 
results begins with the premise that the labor market productivity of siblings are correlated. As a 



21 An interesting project for future research would be to use information from the Dictionary of Occupational Titles on skill 
requirements of occupations and trace how easy to observe and hard to observe productivity characteristics are related to 
changes over a career in the skill requirements of the job a worker holds. It would also be interesting to examine how the 
slopes are influenced by the skill requirements of the initial occupation held by the individual. 

22 The corresponding point estimates are -.022 and .080 when we allow the effects of education and the sibling wage to depend 
on calendar time. 
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worker acquires experience this correlation is reflected in the performance record D t and in wage rates. 
The sibling wage is correlated with education, and so the effect of education on the wage declines with 
experience because firms are estimating productivity with a bigger information set than at the time of 
labor force entry. 23 



The Experience Profile of the Effects of AFQT and Education on Wages 

In this section we take a more detailed look at how the effects of AFQT and s vary with 
experience by estimating models of the form 

w t = f(z,t;bj + h(s,t;b s ) + H(t) + e it 

where b z and b s are now vectors of parameters. Table 6 is based on models in which f(z,t;b z ) and 
h(s,t;b s ) are quartic polynomials in t. In the top panel, the experience measure is potential experience; 
in the bottom panel we use actual experience instrumented by potential experience. All of the models 
in the tables contain the other control variables discussed above. They also include F_ED and 
F_EDxt. 

The columns report <5w/3AFQT, dV/dAFQT, dt dw/ds, and dV/dsdt at various experience 
levels. The first column of the table shows that dw/dAFQT increases steadily from .0197 when t is 0 
to .121 when t is 12. (We only go out to t=12 because sample information becomes thin at higher 
values.) The specification that we use in most of the paper, in which f(z,t;b z ) and h(s,t;b s ) are linear in 
t (column 6 in tables 1-3), suggests an increase in dw/dAFQT from .0179 to .1 16 as t goes from 0 to 
12 . 

As noted earlier, employer learning implies that dw/dAFQT is nondecreasing in t ( i.e., 
d 2 w/dAFQT,dt >0), with a strict inequality likely if some new information arrives each period on y. If 
the noise in observations of y t are iid, then the rate of increase d 2 w/dAFQT,dt should decline with t, as 



23 Farber and Gibbons (1996) use men and women, include Hispanics, and restrict their sample to persons who have worked at 
least three consecutive years since attending school. Using this sample the coefficients on AFQT*t and the effect on s*t of 
adding AFQT*t are similar to those reported above. We also obtain qualitatively similar results when we follow Farber and 
Gibbons and use the level of wages rather than the log. We experimented with an indicator for whether any person in the 
respondent's household had a library card at the time the respondent was 14, a variable which Farber and Gibbons also used. 
We confirm Farber and Gibbons' finding that the coefficient on the residual from a regression of this variable on the initial real 
wage, education, part-time status, an interaction between education and part-time status, race, sex, age, and calendar year 
increases with experience, as well as their finding that the results for library card and AFQT are weakened substantially when 
these variables are interacted with calendar time. However, when we use the library card variable itself the effect of the library 




25 



23 



shown in expression (12c) for 0, above. The rate of increase must decline eventually because the 
amount of additional information in observations of labor market performance is declining. (0 t is 
bounded at 1.) However, it is possible that the first two or three observations on a worker are 
particularly noisy because of factors that we have left out of the model. For example job specific or 
occupation specific match quality may be more variable for new workers than more experienced ones. 

In column 2 we report <3 2 w/dAFQT,dt for various experience levels. The values increase from 
.0025 when t is 0 to .0104 when t is 5, remains at about this level until t is 8 (the maximum is .0108 at 
t = 6.5) and then decline to .0048 when t is 12. These results are reasonably consistent with a decline 
in the amount of new information with experience after an initial period of noisy observations. 24 

In panel B we replace potential experience with actual experience, and treat actual experience 
as endogenous. The 99th percentile value for this variable is only 13.33, so there not much sample 
information on t beyond this point. In column 1 we see that the effect of AFQT increases with 
experience. The rate of increase <3 2 w/<3AFQT,dt rises at first from .0092 when t = 0 to .0138 when 
t=5, but declines to -.0012 when t = 12. However, the standard errors on these derivatives are quite 
large. These results are also loosely consistent with the proposition that the rate at which new 
information about initial productivity arrives declines with experience, but the estimates are not 
sufficiently precise to say much about this. As the NLSY sample ages, it will be interesting to revisit 
the issue. 

In the model with potential experience, the return to education increases slightly between t=0 to 
t=3, and then declines sharply. In the model with actual experience, the decline is constant throughout 
from .0881 at no experience to .0299 at 12 years of experience. 

Testing the restrictions on the experience profiles of the effects of s and z on the wage. 

It is interesting to see how well the experience profiles of the education and AFQT coefficients 
satisfy the restrictions in propositions 3 and 4. One complication in performing these tests is the place of 

card variable falls rather than rises with experience. We thank Henry Farber for assisting us in reconstructing the Farber and 
Gibbons sample. 

24 We used two other non-linear specifications. The first used spline functions with break points at t=2, t=4, t=7,and t=10. In 
the second we restricted f(z,t;bj so that dV/dAFQT dt = 0 when t is 25 and h(s,t;b s ) so that dV/ds dt = 0 when t is 25. The 
idea is that the information about productivity that is contained in AFQT is fully revealed by the time t is 25. Both of these 
specifications yielded results similar to the reported model in which dV/dAFQT dt is flat or increasing and then definitively 
decreasing after about 7 years. 
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race within our model — should we treat race an s variable or a z variable? The answer to this question 
hinges on the extent to which employers violate the law and use race as an indicator of productivity. We 
discuss this at length in section 5 below. For now we will side step the issue by running separate tests on 
the white and black samples. Consider first a specification in which s and z are both scalars, education 
and AFQT score, respectively. Proposition 3 says that the product of -cov(s,z)/var(s) — the negative of 
the coefficient of the regression of z on s — times the coefficient on the interaction between AFQT and 
experience (z*t) should equal the coefficient on the interaction between education and experience (sxt). In 
the white sample, the product is -.00162 and the coefficient on sxt is -.00232. A Wald test does not reject 
the proposition. In the black sample the corresponding numbers are -.00196 and -.00498 and the 
proposition is rejected. 25 

We might also want to test whether the entire profile of the interactions between s and t and 
between z and t are in accordance with proposition 3. One way to do this is to estimate the model in 
which the interactions are specified as fourth-order polynomials and jointly test whether the 
coefficients on the four polynomial interactions are related by the coefficient of the regression of z on 
s. This seems a bit restrictive in that we only expect the relationship to hold over the range of 
observed data and polynomials that have very different coefficients can be fairly similar over a short 
range. However, we performed these tests on models in which the interactions of AFQT and 
education with experience are modeled as fourth order polynomials. Once again, we fail to reject the 
proposition for whites but reject for blacks. 

We also tested proposition 4, the vector analog of proposition 3, on models which include both 
AFQT and father’s education. We also considered as z variables the dummy variables indicating 
whether these quantities were known. This test amounts to a t-test of whether sum of the products of 
-cov(s,z)/var(s) and the coefficient on zxt for each z variable is equal to the coefficient on sxt. For 
whites, the sum of the products equals -.00193, the coefficient on sxt is -.00254, and the proposition is 
not rejected. For blacks, we obtain -.00166 and -.00456 and reject the proposition. 

Wage Growth Equations 

In Table 7 we estimate (9) in first difference form. We restrict b st to be b s0 + b sl t and b n to be 
bzo + b 2l t. The usual reason for working in first differences is to eliminate correlation between the 

25 It should be noted that the standard errors for these tests do not account for possible heteroscedasticity in the data. 
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regressors and a fixed error component. This motivation is not compelling in the present case. 
However, it is possible that the first difference specification may be less sensitive to errors in 
identifying when individuals start their careers. 

Columns 1-4 report OLS estimates with potential experience. The coefficient on the s*At will 
pick up the effects of secular changes in the return to education as well as the changes with experience 
in the return to education. The upward secular trend in the return to schooling may partially explain 
the fact that the s*At has a positive coefficient in the basic model in column 1 while it is negative for 
the corresponding level specification in Table 1, column 4. 26 (A secular trend in the return to education 
or AFQT matters less when estimating the equations in levels because much of the variation in 
experience is across persons of different ages). Also, the estimates are much less precise when we 
estimate in first difference form. However, the key results are qualitatively similar to the level 
specifications. In particular, the coefficient on s/10 declines from .0148 (.0094) in column 1 to -.0092 
(.0110) when we add the AFQT and F_ED interaction terms in column 2. The size of the decline in 
this coefficient is very similar to the drop in the coefficient on s*t when we add AFQT*t and F_ED*t 
to the level specifications. (See columns 5 and 6 of Table 1). The AFQT interaction term is positive 
with a t value of 3.4. The F_ED interaction is also positive and similar in magnitude to the result 
obtained in levels, but it is not statistically significant. 

Columns 5-8 reports IV estimates of wage growth equations using actual experience as the 
experience measure. The coefficient on AFQTxAt/10 is .0905 (.0197), which compares to the value of 
.1148 in Table 3, column 5. The coefficient on s x At/10 declines from .0295 (.0079) to -.0030 (.0100) 
when AFQT* At/10 and F_ED*At/10 are added. 

5. Do Employers Statistically Discriminate on the Basis of Race? 

Thus far we have focused the discussion on employers' use of education as an indicator of labor 
market productivity. In this section we examine the role of race. By almost any measure, young black 
men are disadvantaged relative to whites in the U.S.. On average, black males have poorer, less 
educated parents, are more likely to grow up in a single parent household, live in more troubled 
neighborhoods, attend schools with fewer resources, and have fewer opportunities for teenage 



26 See Murphy and Welch (1991) and many subsequent studies. Mumane et al (1995) provide evidence of an increase in the 
return to aptitude and achievement, as measured by tests. 



employment than white males. Many of these factors are correlated with educational attainment and 
labor market success. They are likely to lead to a black/white differential in the average skills of 
young workers. Discrimination in various forms may further hinder the development of human capital 
in black children, and add to a gap in skills that is due to the race difference in socioeconomic 
background. The gap in some indicators of skill are very large. In our sample, the mean percentile 
score on the AFQT for the black sample is 23.78 while the mean for whites is 53.27. Neal and 
Johnson (1996) and a number of earlier papers have shown that in the NLSY sample of men a 
substantial part of the race gap in wages is associated with the race gap in AFQT. 

If pre-market discrimination is an important factor in a gap between the average skills of black 
and white workers, then it seems likely that various forms of current labor market discrimination 
contribute to race differences in wages that are unrelated to skill. However, it is nevertheless 
interesting to examine the possibility that a correlation between race and skill might lead a rational, 
profit maximizing employer to use race as a cheap source of information about skills and statistically 
discriminate on the basis of race. Such statistical discrimination along racial lines can have very 
negative social consequences and is against the law. However, such discrimination would be difficult 
to detect. 

A statistically discriminating firm might use race, along with education and other information 
to predict the productivity of new workers. With time, the productivity of the worker would become 
apparent and compensation would be based on the larger information available rather than the limited 
information available at the time of hire. Consequently, if statistical discrimination on the basis of 
race is important, then adding interactions between t and z variables such as AFQT and father's 
education to the wage equations should lead to a positive (or less negative) coefficient on black*t and 
should lead to an increase in the race intercept. As noted in section 2, if firms use race as information 
then it behaves as an s variable in the model and the logic is the same as in our analysis of the effect of 
education. On the other hand, if firms do not use or only partially use race as information, then a race 
indicator behaves as a z variable. As discussed in Section 2, in this case the race gap should widen 
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with experience if race is negatively related to productivity, and adding a second z variable that is 
negatively related to race will reduce the race gap in experience slopes. 27 

The race differential in our basic specification in column 1 of Table 1 is -.1801. This drops to 
-.0969 when AFQT, F_ED, and education *t are added to the equation (column 5). When Black*t/10 is 
added in column 6, it enters with a coefficient of -.1456 (.0216). This coefficient is consistent with the 
hypothesis of no or very limited statistical discrimination on the basis of race and inconsistent with the 
hypothesis that firms make full use of race as information. The coefficient on Black is insignificantly 
different from 0, although the models do not provide a clear prediction about the sign of this variable, 
since race may be correlated with information in q that can legally be used. The fact that coefficient on 
Black x t/10 rises to -.0816 when F_ED*t and AFQT*t are added to the equation (column 8) is not 
informative about whether or not firms make full use of race as information. 28 

We obtain similar results using alternative experience measures in Tables 2 and 3. In Table 4, 
columns 7 and 8 we obtain similar results after controlling for initial 2-digit occupation. We obtain 
similar results using growth equations in Table 7, which should be robust to the presence of an economy 
wide time trend affecting the return to education, race, and AFQT. However, in the level equations we 
find that the results for race are sensitive to treatment of economy wide time trends. When we use 
potential experience as the measure of t the coefficient on Black*t declines only slightly ( from -.0146 to - 
.0144) when we adding time trend interactions involving race and AFQT to the wage level equation 
corresponding to Table 1, column 7, but the race-experience interaction no longer drops when AFQT and 
experience is added. (Not reported.) 

We wish to stress that the simple model of statistical discrimination cannot explain the negative 
coefficient on Blackxt unless firms do not make full use of race as information. The accumulation of 
additional information during a career that can legally be used to differentiate among workers is fully 
consistent with our results. However, there are several other explanations of the race differences in the 

27 The learning model in section 2 implies that differences across groups in the association between s and the z variable will 
lead to group differences in the b n and b a coefficients. We have not explored this empirically. An obstacle to doing so is that 
the results might be sensitive to the linearity assumptions that we have made. 

28 Japanese and Chinese Americans score higher on aptitude and achievement tests than whites. Our analysis predicts that if 
firms statistically discriminate on the basis of race and ethnic background then the addition of AFQT and AFQT*t to an 
equation containing a dummy and experience interaction term for these groups will lead to an increase in the dummy variable 
and a reduction in the experience interaction. Sample sizes do not permit an analysis of these groups. While one could 
differentiate among whites based on ethnicity (see Botjas (1992), it is not clear that these ethnic differences are observable to 
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experience slope in the literature that may be at work here. It is also important to point out that the results 
for Black and Black x t alone (i.e., ignoring the behavior coefficients of the coefficients on education and 
education xt) are potentially consistent with a story in which firms are fully informed, AFQT is positively 
associated with on-the-job training, and the race difference in AFQT is partially responsible for a race 
differential in wage growth. Adding AFQTxt would reduce a negative bias in Black x t associated with 
differential training levels. The increase in Blackxt when AFQTxt is added to the model would lead to a 
fall in the coefficient on Black. As we report below, we obtain qualitatively similar results when we add 
controls for employer training, but these controls reduce the magnitude of the coefficient on Blackxt and 
the effect of adding AFQTxt on the coefficient on Blackxt. 

Another potential test of whether race is used to statistically discriminate or not is to see 
whether proposition 4 holds either when race is treated as an s variable or when it is treated as a z 
variable. To do this, we use the model in column 8 of table 1. With race treated as an s variable, we 
regress the z variables (AFQT, father’s education, and the dummies for not knowing these quantities) 
on the two s variables. We sum the product of these coefficients and the coefficients on the z x t 
interactions in the main regression and compare them to the coefficients on the sxt interactions. We 
can then conduct a joint test of whether these two quantities are equal. For the education interaction 
the sum of the products equals -.00183 while the model coefficient is -.00301. For the race interaction, 
the two terms have opposite signs; the sum is .00644 while the model coefficient is -.00816. Not 
surprisingly, the proposition is soundly rejected. 

When we treat race as a z variable, we begin our test by regressing the 5 z variables on 
education, our s variable. Here, we have only one restriction to test. The sum of the products equals 
-.00215 while the model coefficient equals -.00301. The proposition can be rejected at conventional 
levels of significance (the P-value is .027) but with corrected standard errors this will probably not be 
the case. This is a further indication that employers are not treating race as information, or at least not 
fully. 



6. Models with Training 

In Table 8 we report estimates of equation (24) along with models that exclude the training 
variables. In these models we have excluded father's education. In the basic model in column 1 the 



employers. Our methods could be used to investigate statistical discrimination on the basis of attending prestigous colleges or 
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coefficient on sxt/10 is -.0102. In column 2 we add T t and 2T it to the equation. The variable T t has 
the expected negative sign of -.1044 (.0179), while ST it has a coefficient of .1864 (.01 14). The 
coefficient on s*t/10 falls to -.0346. The coefficient on AFQT falls from .0828 to .0582 while the 
coefficient on education rises slightly. The substantial negative experience slope on education might 
be consistent with a human capital story in which knowledge obtained in school depreciates over time 
unless one receives training. In column (3) AFQT*t/10 enters with a coefficient of .0502 (.0125), and 
the coefficient on and s*t/10 drops from -.0358 to -.00427. These changes are consistent with 
employer leaming/statistical discrimination. If we reverse the order in which the variables are added 
by adding AFQT*t before the training measures, the marginal effect of the training measures on 
educations is much smaller. We have also estimated separate models for blacks and for whites and 
obtain a similar pattern. 

In Columns 4-6 we investigate the effect of introducing the training measure on the race gap in 
wage slopes. The coefficient on black* t/ 10 declines from -.1467 to -.1048 when we add the training 
measures. Adding AFQT*t/10 leads to a further decline to -.0777. 

To reduce the difficulties associated with the lack of data on training in the early years of the 
study and individual heterogeneity that is correlated with both training and wages, we turn to a first 
differenced version of (24). In the first difference version the current and lagged values of T, enter. 
These results are in Table 9. The coefficient on education*t/10 declines from .0126 (.0094) to .0073 
when the training measures are added. The coefficient on Black rises from -.0995 (.0351) to -.0923 
(.0353). However, the coefficient on T t is positive while the coefficient on T,., is negative. These 
signs are inconsistent with a simple human capital model but are consistent with an EL-SD model in 
which training opportunities are given to more productive workers and learning about productivity 
occurs over time. Adding the training variables to a model that contains AFQT and F_ED has little 
impact on the coefficients on these variables. (Compare columns 2 and 4.) Imprecision in the training 
measures may partially explain this fact, but does not provide an explanation for the sign pattern in the 
training coefficients. The coefficients on s*t and Black*t decline in absolute value when AFQT and 
F_ED are added, as is predicted by the EL-SD. Overall, the wage change results are quite consistent 
with an important role for EL-SD 



particular college majors. 
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We view the evidence as consistent with a role for both human capital and EL-SD, but cannot 
make a precise statement about the relative contribution of these factors because, as discussed above, 
training will be influenced by new information about employee performance and the quality of the 
training data is suspect. 



7. Information Transmission Across Firms: 

The formal model that we have used to interpret the results assumes that employers have the 
same information about workers. The results suggest that information about productivity does 
eventually get reflected in wages. However, they do not identify whether these adjustments occur 
primarily in the current firm, presumably in response to outside pressure from competitors who have 
information about the worker, or through moves to other employers with associated wage increases for 
workers who do not move. 29 In this section we briefly examine the issue of information transmission 
across firms. 

A number of theoretical papers discuss whether information about productivity will be 
reflected in promotion paths and wage increases within firms, as well as the strategies firms might use 
to try to hide information about good workers (e.g. Greenwald (1986), Waldman (1984), Lazear 
(1986), Gibbons and Katz (1991)). 30 Unfortunately, the theory is ambiguous about whether a firm's 
private information concerning the worker will be reflected in wages offered by that firm to incumbent 
workers and about the mechanism that induces the firm to adjust wages. In some private information 
models in which only wages and perhaps position within the firm are observable to outside firms, the 
employer’s information is not reflected in wages until the worker gets an outside offer. In Waldman 
(1984) it is reflected in wages after the firm reassigns the worker to a position in which output is more 



29 Although we do not know of systematic evidence on this, casual empiricism suggests that changes in the legal system have 
led some firms to adopt the explicit policy of not providing references for former employees. Also, increased firing costs and 
concern about litigation may have made employers more reluctant to discharge workers for poor performance. Statistical 
discrimination may become a more serious problem if information flows are restricted. This may lead firms to relate 
compensation to performance more explicitly, with more turnover being a "voluntary" response to below average wage 
increases. On the other hand, difference in wages across groups may be attenuated because firms may be reluctant to open up 
large wage differentials between persons with similar education, seniority, and experience. It is possible that the balance 
between these two considerations has changed over time. 



30 None of this literature considers the implications of the possibility that employers and co-workers acquire reputations for how 
positive they are in promoting the careers of individuals or that the incentives of co-workers and even supervisors to keep 
favorable information about a colleague private or in concealing unfavorable information from associates outside the firm may 
be quite different from those of the employer. These factors would undermine the case that firms would want to and be able to 



keep inside information inside the firm. 
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sensitive to ability. In Gibbons and Katz it is reflected in wages if the firm chooses not to lay off the 
worker. The firm lays off low productivity workers, who are hired by other firms at lower wages. 

Outside firms infer that the remaining workers are of higher quality, which forces the employer to raise 
wages of those who stay with the firm. Both models have the implication that hard to observe 
variables like AFQT, F ED, and the wage of an older sibling should be positively related to wage 
growth if one does not condition on whether a person was laid off or not. This is what we found 
above. 

Gibbons and Katz (1991) provide empirical support for the hypothesis that layoffs should be 
negatively related to wage growth. But there are a number of other reasons why layoffs should be 
negatively related to wage growth (labor market conditions, lost seniority, for example). To obtain 
more focused tests, we interact personal characteristics that are hard for employers to observe directly 
with indicators for layoffs and discharges. The coefficients on these variables should differ from the 
coefficients on characteristics that affect productivity and are easy for employers to observe, such as 
years of schooling if (1) layoffs occur for multiple reasons, some of which have nothing to do with the 
worker, (2) the probability that a layoff reflects low worker specific productivity relative to the wage is 
related to z variables, and (3) outside employers have information about the nature of the layoff or 
obtain information (through references, for example) about productivity. 

This suggests an equation of the form 

w, - w t _, = p o + Layoff^, + zP 2 + z[LayoffJ p 3 + z[LayoffJt P 4 + other controls. 

If knowledge acquired by firms is reflected in wages, then P 2 should be nonzero, and P 3 and P 4 should 
be near zero. If knowledge acquired by firms is not reflected in wages, then P 2 should be small and P 3 
and P 4 should be nonzero. Given sample size limitations we have estimated a simplified version of the 
above equation on the sample of layoffs only, with z[Layoff t ]t excluded: 

w, - w m = (Po + Pi) + z P 3 + other controls. 

Our evidence on whether hard to observe variables such are positively associated with layoff losses is 
weak at best. In fact, we find that losses are larger for persons with high AFQT. We have not controlled 
for labor market conditions, and among the sample of layoffs they may be correlated with AFQT. 31 

31 We investigated whether the finding that wage losses rise with AFQT is driven by a positive correlation between AFQT and 
employment in a white collar, non union job, where layoffs are least likely to be influenced by seniority rules. Gibbons and 
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In Table 10 we report estimates of the effect of AFQT and F_ED on employer initiated 
separations. These include layoffs, firings, and plant closings. Our results were not very sensitive to 
distinguishing among these three types of job loss. We find that AFQT has a weak negative effect on 
the probability of losing one’s job, even after conditioning on seniority in the firm. However, when 
seniority is controlled for a swing of two standard deviations in AFQT changes this probability by .02, 
which is only 1/5 of the mean layoff rate of .1. We obtain similar results when the seniority control is 
dropped. 

Our results suggest that only a small part of the rise with t in the effect of AFQT on wages 
operates through an association between AFQT and layoffs and the wage losses experienced by those 
who are laid off. 

Lazear (1986) presents a model in which both the current firm and outside firms observe 
indicators of the productivity of the worker. His model predicts that workers with favorable 
productivity traits that are hard to observe directly will be more likely to receive outside offers and 
more likely to quit than workers whose hard to observe characteristics make them less productive. In 
results not reported we find that F_ED is positively related to the quit rate conditional on education 
and experience and tenure. AFQT does not have a significant effect. Neither AFQT nor F ED is 
significantly related to wage growth among those who quit. (Not reported). 

These results tentatively suggest that information flows in the labor market are sufficient to 
force a firm to differentiate among workers as the firm obtains better information about their 
productivity. A careful investigation will require a separate paper.. 

8. The Potential for Testing Services to Certify Skill 

Our estimates provide information about the rate at which employers leam about worker 
quality. In Altonji and Pierret (1996) we use our empirical estimates to explore the implications of the 
rate at which employers leam about worker quality for the empirical relevance of the educational 
screening hypothesis. We show that even if employers leam relatively slowly about the productivity 
of new workers, the portion of the return to education that could reflect signaling of ability is quite 
limited. While education may be too expensive to serve as a means for able workers to certify 



Katz note that layoffs are likely to be a particularly negative signal for white collar workers and restrict their analysis to them. 
However, splitting the leads to an even more negative coefficient for white collar workers than for blue collar workers. 



themselves to employers, perhaps other mechanisms could perform this function, at least for some 
determinants of productivity. Here we point out that interpreting our estimates of the time profile of 
the effect of AFQT on wages as the result of employer learning implies that high ability workers 
would have a substantial financial incentive to take the AFQT to differentiate themselves from those 
who are less able in this dimension. 

Suppose that a third party were to administer the AFQT and certify the results to outside 
employers, in much the same way that the Educational Testing Service administers the SAT exams. 
Using our estimates of the learning profile and assuming that firms know all of the information contained 
in AFQT by the time experience is 15 , we have computed how much a person who believes that he is 1 
standard deviation above the mean for the AFQT would pay to take the test at the time he enters the 
workforce. 32 The OLS estimates using potential experience underlying Table 6, panel A, column 3 ) 
imply that if firms become fully informed about productivity by the time experience is 15 and the interest 
rate is . 1 , then the person would be willing to pay .559 of the first year's salary for the test.33 The 
corresponding value when we use potential experience as an instrument for actual experience (panel B, 
column 3 ) is . 330 . 

These calculations raise the issue of why such a testing service has not emerged if information 
is initially imperfect. One answer is that firms are not aware that the AFQT captures characteristics 
that have a strong association with productivity. It is only recently, with the availability of the NLSY, 
that labor economists have become aware of this. Another is that it would be difficult for a testing 
firm to become established at a national level. A third is that, given race differences in the distribution 
of AFQT scores, firms who make use of AFQT information in hiring for a specific job would have the 
burden of establishing that they are relevant to productivity in that job or run the risk of violating 
discrimination laws. This would be true even if individuals provided firms with the test results. 

However, we do not find these answers to be fully satisfactory.34 Analyses based on variables such as 



32 if a worker did not know his ability, he could take a practice test on his own. Presumably, this would not raise the total cost 
of the test very much. 

33 Here we are assuming that only 1 worker takes the test and ignoring the fact that the composition of the pool of workers 
who choose to take the test in equilibrium would influence return for a particular type of worker. 

34 Note also that in the absence of an institution such as the Educational Testing Service, a firm might provide the test. Some 
firms perform their own testing.. However, if the results were available to the employees or other firms know that a particular 
firm tests its employees, then the firm would not be able to capture the full return to testing. 
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the wage rates of siblings or father's education may be less vulnerable to this objection. In any event, 
we should also point out that our estimates of the AFQT-experience profile are sensitive to treatment 
of time trends and training, so that financial return to being certified as high AFQT is probably 
substantially less than the above numbers imply. 

9. Conclusion 

This paper explores the implication of the premise that firms use the information they have 
available to them to form judgments or about the productivity of workers and then revise these beliefs as 
additional information becomes available. This a premise that seems natural to us and receives some 
strong empirical support in Farber and Gibbons (1996). If profit maximizing firms have limited 
information about the general productivity of new workers, then they may use easily observable 
characteristics such as years of education or race to statistically discriminate among workers. We show 
that as firms acquire more information about a worker, pay may become more dependent on productivity 
and less dependent on easily observable characteristics or credentials. This basic idea is quite general and 
provides a way to test for statistical discrimination in the labor market and elsewhere in situations in which 
agents leam, such as credit markets. 

We investigate it empirically by estimating a wage equation that contains interactions between 
experience and hard to observe characteristics such as AFQT and father's education along with the 
interaction between experience and a variable that firms can easily observe, such as years of education. 

We assume that all three variables are related to productivity. We find the wage effect of the 
unobservable productivity variables rise with time in the labor market and the wage effect of education 
falls. These results match the predictions of our model of statistical discrimination with learning. 

We use a similar methodology to investigate whether employers statistically discriminate on 
the basis of race. If our model is taken literally, the small race differentials for new workers and the 
spread in the race gap with experience is most consistent with the view that race is negatively 
correlated with productivity and the productivity gap becomes reflected in wages as firms acquire 
additional information that can legally be used to differentiate among workers. We wish to stress 
however, that other factors are probably as or more important in differences between whites and blacks 
in wage profiles, and race differences in human capital accumulation accounts for at least part of our 



findings. Also, our empirical results for race are sensitive to treatment of economy wide changes in 
the effects of race, AFQT, and education. Future research should also address the large race gap and 
education gap in employment rates, particularly for young workers. In situations in which there are 
alternatives to the conventional labor market and employees in the alternative sector do not acquire 
work histories that have value or are informative to firms in the conventional sector, then statistical 
discrimination of the type described above may reduce participation rates of the disadvantaged group 
in the conventional labor market. 

It is worth emphasizing that the analysis in the paper suggests alternative interpretations of 
empirical models of wages and other outcomes that involve experience interactions. It will be useful 
to re-examine the results of other studies that included interactions between experience and easy to 
observe variables such as schooling, race, gender, and experience in equations that also contain 
interactions between experience and harder to observe background measures. We have not been 
successful in sorting out the relative importance of differences among workers in training on one hand 
and statistical discrimination with learning on the other for our results. This is an important area for 
future research. 

An important and reasonably straightforward extension of the analysis is to other easily 
observable and hard to observable background characteristics. For example, do firms statistically 
discriminate on the basis of the neighborhood one is from or on the basis of the reputation of the high 
school, college, or graduate school one attends? A study of whether new immigrants are judged by the 
average skills of their countrymen in the U.S. would be a natural step in the research by Boijas (1992) 
and others documenting differences among immigrants in labor market success. These issues are 
researchable using the approach developed in this paper. Finally, it would be useful to apply the 
methods of the paper to other labor market outcomes in addition to wages. 
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Appendix 2: Derivation of Equation (16) and (17). 



Consider equation ( 15). Rewriting var(s,z)‘‘ as a partitioned matrix leads to 

var (s) co v(s,z)\ 

var (s,z )' = 

cov(z,s ) \ar(z)_ 

where var(s,z) is the (K+l)x(K+l) variance matrix. 

Using the partitioned inverse formula and ignoring the first column (since it will be multiplied by 0), 
we have: 

r u i l r 

' [cov(z, E( Av + e| D t ))] 
where G = [ varfi) * var (z)- co vfz, s) *cov(s, z)]~' 

Now, consider the diagonal matrix K which has elements of cov(z, Av+e) along the diagonal. K' 1 is 
also diagonal. Thus (15a) may be rewritten as: 

Tbs.1 fbsol r-cov(s,z)*G] 
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Manipulating this further gives us: 
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Table 3: IV Estimates of the Effects of Standardized AFQT, Father's Education, and Schooling on Wages 
Dependent Variable: Log Wage. Experience Measure: Actual Experience with Potential Experience as Instruments . 
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Table 4: Estimates of the Effects of Standardized AFQT, Father's Education, and Schooling on Wages 

Controlling for 2-digit Occupation Codes of Initial Job 
Dependent Variable: Log Wage. Experience Measure: Potential Experience. 

OLS estimates (standard errors) 
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Table 6: The Effects of Standardized AFQT, and Schooling on Wages Over Time 
Derivatives at Selected Experience Levels 
Dependent Variable: Log Wage 



A) Potential Experience. 



Years of 


5w t 


awf 


5w, 


awf 


Experience 


<3 AFQT 


a afqt, at 


5s 


5s, 5t 


0 


0.0197 

(0.0235) 


0.0025 

(0.0139) 


0.0786 

(0.0092) 


0.0053 

(0.0040) 
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0.0235 

(0.0275) 


0.0049 

(0.0144) 


0.0830 

(0.0101) 


0.0034 

(0.0040) 
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0.0370 

(0.0347) 


0.0084 

(0.0155) 


0.0865 

(0.0116) 


0.0002 

(0.0042) 
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0.0560 

(0.0415) 


0.0104 

(0.0166) 


0.0843 

(0.0131) 


-0.0023 

(0.0043) 
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0.0881 

(0.0512) 


0.0104 

(0.0181) 


0.0731 

(0.0152) 


-0.0048 

(0.0046) 


12 


0.1206 

(0.0640) 


0.0048 

(0.0201) 


0.0513 

(0.0179) 


-0.0056 

(0.0048) 



B) Actual Experience Instrumented with Potential Experience. 



Years of 


5w, 


5wf 


5w, 


5wf 


Experience 


aAFQT 


aAFQT, at 


5s 


5s, 5t 


0 


0.0183 

(0.0205) 


0.0092 

(0.0231) 


0.0881 

(0.0105) 


-0.0024 

(0.0086) 


1 


0.0278 

(0.0316) 


0.0099 

(0.0251) 


0.0843 

(0.0137) 


-0.0051 

(0.0089) 


3 


0.0496 

(0.0496) 


0.0120 

(0.0288) 


0.0711 

(0.0190) 


-0.0074 

(0.0096) 


5 


0.0755 

(0.0658) 


0.0138 

(0.0323) 


0.0566 

(0.0236) 


-0.0068 

(0.0102) 


8 


0.1172 

(0.0893) 


0.0131 

(0.0373) 


0.0406 

(0.0300) 


-0.0037 

(0.0112) 


12 


0.1475 

(0.1206) 


-0.0012 

(0.0436) 


0.0299 

(0.0381) 


-0.0032 

(0.0123) 



The equations the same variables as the equation in column (6) of table 1 except the interactions between 
education and experience and between AFQT and experience involve fourth-order polynomials in experience. In 
panel B, the instrumental variables are the corresponding terms involving potential experience and the other 
variables in the model. 
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Table 8: The Effects of Standardized AEQT, Schooling, and Training on Wages 
Dependent Variable: Log Wage Experience Measure: Potential Experience 

Training Measure: Predicted before 88, Actual After 

OLS estimates (standard errors) 
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Table 10: The Effects of Potential Experience, Standardized AFQT, Father's Education, and Schooling on the 

Probability of Employer-Initiated Separation 

Linear Probability Models 
Dependent Variable: Employer-Initiated Separation. 



OLS estimates (standard errors) 





Model: 


(i) 


(2) 


(a) 


Potential Experience / 10 


-0.0302 


-0.1646 






(0.0194) 


(0.0540) 


(b) 


Potential Experience 


-0.0143 


0.0148 




Squared / 100 


(0.0113) 


(0.0129) 


(c) 


Tenure 


-0.0141 


-0.0146 






(0.0006) 


(0.0006) 


(d) 


Education 


-0.0153 


-0.0206 






(0.0012) 


(0.0027) 


(e) 


Black 


0.0272 


0.0265 






(0.0057) 


(0.0057) 


(0 


Standardized 


-0.0108 


-0.0251 




AFQT 


(0.0029) 


(0.0061) 


(g) 


Father's 


0.0303 


0.0991 




Education / 100 


(0.0701) 


(0.1532) 


(h) 


Education * 




0.0083 




Experience / 10 




(0.0033) 


(0 


AFQT* 




0.0188 




Experience / 10 




(0.0065) 


( j) 


Father's Ed * 




-0.0910 




Experience /1000 




(0.1738) 



Note: An Employer-Initiated Separation includes separations because of layoffs, firings, and plant closings. All equations 
control for urban residence, and dummy variables to control for whether Father’s education is missing and whether AFQT 
is missing, and interactions between these dummy variables and experience when Experience interactions are included. 
Standard errors are White/Huber standard errors computed accounting for the fact that there are multiple observations for 
each worker. The sample size is 27443 observations from 4034 individuals. 
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Table Al: Descriptive Statistics 



Variable 


Mean 


Standard 

Deviation 


Minimum 


Maximum 


Real Hourly Wage 


8.370 


4.766 


2.01 


96.46 


Log of Real Hourly Wage 
(w) 


2.005 


0.474 


0.7 


4.57 


Potential Experience (t) 


7.349 


3.665 


0 


21 


Actual Experience (t) 


4.925 


3.424 


0 


18.26 


Education (s) 


12.699 


2.136 


8 


18 


Black dummy (Black) 


0.290 


0.454 


0 


1 


Dummy for not knowing 
AFQT Score 


0.038 


0.191 


0 


1 


Standardized AFQT Score 
(AFQT) 


-0.133 


1.022 


- 2.780 


1.922 


Dummy for not knowing 
Father's Education 


0.119 


0.324 


0 


1 


Father's Education (F ED) 


11.709 


3.112 


4 


20 


Dummy for Urban Dweller 


0.781 


0.413 


0 


1 


Year 


86.623 


81.558 


79 


92 


Training (Tj) 


0.096 


0.200 


0 


1 


Cumulative Training: (I T x ) 


0.462 


0.549 


0 


5.592 



Sample size = 27,704 observations except for the training measures where it is 25,115 observations. 
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Table A 2: Relationships Among Wages, Schooling, AFQT, and Parental Education Simple Regression 
Coefficients (standard error) and [Correlation coefficient] 







Dependent Variable 


Right Hand 
Side Variable 


Log Wage 


Highest 

Grade 


Father's 

Education 


Standard. 

AFQT 


Weeks of 
Company 
Training 


Layoff 


Quit 


Actual 

Experience 


Potential 

Experience 


Highest 

Grade 


0.0785 

(0.0014) 

[0.3615] 




0.6197 

(0.0098) 

[0.4029] 


0.2747 
(0. 0027) 
[0.5829] 


0.1189 

(0.0163) 

[0.0514] 


-0.0193 

(0.0010) 

[-0.1259] 


-0.0128 

(0.0014) 

[-0.0589] 


-0.0823 

(0.0103) 

[-0.0329] 


-0.4831 

(0.0106) 

[-0.2923] 


Father’s 

Education 


0.0298 

(0.0010) 

[0.2092] 


0.2592 

(0.0041) 

[0.4029] 




0.1341 

(0.0019) 

[0.4362] 


0.0621 

(0.0106) 

[0.0392] 


-0.0059 

(0.0007) 

[-0.0542] 


0.0014 

(0.0009) 

[0.0112] 


-0.0323 

(0.0067) 

[-0.0331] 


-0.1660 

(0.0071) 

[-0.1538] 


Standardized 

AFQT 


0.1565 

(0.0031) 

[0.3567] 


1.2245 

(0.0119) 

[0.5829] 


1.4280 

(0.0204) 

[0.4362] 




0.3072 

(0.0345) 

[0.0645] 


-0.0377 

(0.0021) 

[-0.1174] 


-0.0121 

(0.0029) 

[-0.0306] 


-0.1036 

(0.0218) 

[-0.0142] 


-0.8138 

(0.0227) 

[-0.2329] 


Weeks of 
Company 
Training 


0.0045 

(0.0007) 

[0.0429] 


0.0214 

(0.0029) 

[0.0514] 


0.0268 

(0.0046) 

[0.0392] 


0.0124 

(0.0014) 

[0.0645] 




-0.0011 

(0.0004) 

[-0.0130] 


-0.0017 

(0.0006) 

[-0.0190] 


-0.0173 

(0.0044) 

[-0.0282] 


-0.0297 

(0.0047) 

[-0.0453] 


Layoff 


-0.1659 

(0.0104) 

[-0.1094] 


-0.8921 

(0.0468) 

[-0.1259] 


-0.6558 

(0.0728) 

[-0.0542] 


-0.3904 

(0.0222) 

[-0.1174] 


-0.2702 

(0.1112) 

[-0.0130] 




-0.2707 

(0.0092) 

[-0.2080] 


-1.1766 

(0.0696) 

[-0.1391] 


-0.3223 

(0.0753) 

[-0.0629] 


Quit 


-0.2145 

(0.0076) 

[-0.1909] 


-0.3232 

(0.0348) 

[-0.0589] 


0.0814 

(0.0539) 

[0.0112] 


-0.0683 

(0.0165) 

[-0.0306] 


-0.2321 

(0.0821) 

[-0.0190] 


-0.1478 

(0.0050) 

[-0.2080] 




-1.2747 

(0.0510) 

[-0.1658] 


-0.8834 

(0.0553) 

[-0.1070] 


Actual 

Experience 


0.0444 

(0.0010) 

[0.2893] 


-0.0374 

(0.0047) 

[-0.0329] 


-0.0350 

(0.0072) 

[-0.0331] 


-0.0106 

(0.0022) 

[-0.0142] 


-0.0436 

(0.0110) 

[-0.0282] 


-0.0116 

(0.0007) 

[-0.1391] 


-0.0230 

(0.0009) 

[-0.1658] 




0.8605 

(0.0045) 

[0.7953] 


Potential 

Experience 


0.0174 

(0.0010) 

[0.1044] 


-0.1899 

(0.0042) 

[-0.2923] 


-0.1561 

(0.0066) 

[-0.1538] 


-0.0718 
(0.0020) 
[-0. 2329] 


-0.0647 

(0.0103) 

[-0.0453] 


-0.0027 

(0.0006) 

[-0.0629] 


-0.0138 

(0.0009) 

[-0.1070] 


0.7452 

(0.0039) 

[0.7953] 
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